Method for smoothing transitions between scenes of a stereo film and controlling or regulating a plurality of 3D cameras

ABSTRACT

In an exemplary embodiment, a method for producing a stereo film is provided, wherein a first image that is supplied ( 10 ) by a first camera rig having at least two cameras is followed ( 50 ) by a second image from a second camera rig, wherein furthermore a disparity table for definition of the displacement of a defined image point in a first sub-frame supplied by a first camera of the first camera rig relative to an image point similar thereto in a second sub-frame supplied by a second camera of the first camera rig is determined ( 20, 30 ) in order to obtain information about the depth of the first image composed of the first sub-frame and the second sub-frame, wherein the depth information of the disparity table of the first image of the first camera rig is used ( 60 ) for processing of the second image of the second camera rig. The invention also relates to controlling (means) or regulating means for a plurality of 3D cameras configured to carry out said method.

INCORPORATION BY REFERENCE TO ANY PRIORITY APPLICATIONS

Any and all applications for which a foreign or domestic priority claim is identified in the Application Data Sheet as filed with the present application are hereby incorporated by reference under 37 CFR 1.57.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows a flowchart of a method for smoothing transitions between scenes of a stereo film and controlling or regulating a plurality of 3D cameras.

FIG. 2 shows a schematic view of a system for smoothing transitions between scenes of a stereo film and controlling or regulating a plurality of 3D cameras.

DETAILED DESCRIPTION

This application relates to a method for producing a stereo film, wherein a first image that is supplied by a first camera rig having at least two cameras is followed by a second image from a second camera rig, wherein furthermore a disparity table for the definition of the displacement of a defined image point in a first sub-frame supplied by a first camera of the first camera rig relative to an image point similar thereto in a second sub-frame supplied by a second camera of the first camera rig is determined in order to obtain information about the depth of the first image composed of the first sub-frame and the second sub-frame.

Camera rigs and methods for producing 3D films, or so-called stereo films, are known from prior art. Such films project a specific image for each eye of the viewer so that a three dimensional image is composed for the viewer.

Usually for the case of camera rigs used for the recording of scenes, two cameras are combined in each camera rig. While a first camera rig having two cameras is directed from a first viewing angle towards a scene to be filmed, a second camera having two further combined cameras is directed at a different viewing angle towards the scene. If now a “cut” is performed from the first camera rig to the second camera rig, i.e. if a sequence of images follows, which images are supplied by the first camera rig, said sequence being a sequence of images following the cut, which sequence is supplied by the other camera rig, then, in the case of three dimensional films, undesired effects often occur for the viewer due to the cut.

Thus it can happen that, when the first camera rig is directed towards a scene, a much larger impression of depth is achieved such that for example an object of the scene is perceived by the viewer to be far in front of a virtual plane of the screen, whereas upon cutting, the object is perceived as being far behind the plane of the screen, or at least not at the position observed from the other perspective a short while previously.

While no unpleasant effects occur in real life, if an object should suddenly come towards the viewer, this fast change in impression of depth during the reproduction of related information leads to discomfort for the viewer.

This is, inter alia, because in real life, for example in the case of observing a landscape, wherein an object, such as a ball spontaneously speeds towards a viewer, this ball is not in focus at first and thus the negative effects that occur when observing a film do not appear.

Therefore it is an object of some embodiments of the present invention to allow a cut to be made in a stereo film such that a first image that is supplied by a first camera rig having at least two cameras can be followed by a second image from a second camera rig, and the impression of depth created in both images does not cause unpleasant side effects for the viewer.

This object is solved according to some embodiments of the present invention in that the depth information of the disparity table of the first image of the first camera rig is used for the processing of the second image of the second camera rig.

A disparity table is understood to mean such a compilation of information that makes possible the assessment of the impression of depth of the first image. The impression of depth is created by means of an image depth analysis which can also be called lateral disparity or disparity. Such a disparity is an offset in the position which the same object in the picture occupies on two different image planes. The optical centers for the image planes of the lenses are in this way spatially separated from each other by the basis b. If both lenses have the focal length f,

$r = \frac{b \cdot f}{d}$ applies for distance r, wherein d stands for the disparity. This formula applies only to the stereo normal case, i.e. when the two cameras are aligned in parallel. If both cameras are slightly pivoted towards one another, i.e., convergently aligned, a modified formula is applicable.

One can therefore determine the distance r to an object by a measuring of the disparities in the stereo image. A disparity map or a disparity table of a stereo image is therefore synonymous with a depth map.

It should be noted here that an image is understood to mean the compilation of two sub-frames, wherein each of the two sub-frames is supplied by one of the two cameras of a defined camera rig.

Some embodiments of the invention, which can be implemented in a Stereoscopic Image Processor (SIP), can analyze a scene and provide metadata concerning the depth or depth information of a near and distant object, and also supply information regarding the total space/total volume in real-time. In addition the SIP can also perform an image processing and image manipulation. In some embodiments of the proposed invention it is achieved that, based on the provided data, it is ensured that 3D-changes within a scene remain within the depth budget.

Advantageous embodiments are claimed in the dependent claims and are explained in more detail below.

Thus it is advantageous when a second sub-frame is displaced relative to a first sub-frame supplied from a first camera of the two cameras, wherein said first sub-frame forms the second image together with the second sub-frame supplied from a second camera of the second camera rig. With a displacement of both sub-frames of the second camera rig relative to each other, the impression of depth is changed. Therefore the impression of depth can be adjusted to fit the impression of depth in the previously present first image.

When the second sub-frame is displaced horizontally, one can resort to a standard depth effect generation. If the second sub-frame, i.e. if for example a right sub-frame, is displaced from a left sub-frame towards the right, then the depth effect increases, whereas the depth effect is reduced for the reverse case. This is due to the lines of sight which run almost parallel to each other when observing a distant object, whereas in the case of a very near object, even an intersecting of the sightlines can occur. The disparities are arranged around a zero point, and can thus take negative as well as positive values.

It is further advantageous when the second sub-frame of the second camera rig is displaced in a displacement step by such a distance until the same disparity is present between the two sub-frames of the second image as between the two sub-frames of the first image. The viewer's eye in that case does not have to adjust and negative effects are almost completely removed.

The method can be further improved if the second image is magnified or reduced by means of a zoom setting with a correction step dependent on the disparity table of the first image, until the depth distance between a point in the foreground and a point in the background of the second image corresponds to the depth distance between these two points in the second image. By means of the change of the zoom setting, the perceived depth distance from a first object in the scene to a second object in the scene changes.

The disparities also diminish when the zoom setting operates only as a digital zoom and does not operate mechanically on the physical lenses of the second camera rig.

It is a further advantage here when the first and second sub-frame of the second image is magnified or reduced. With a reduction of the second image, the disparities also reduce linearly with the reduction of the image so that the negative side effects caused by disparities being too high when cutting do not appear to the user.

When the correction step is performed simultaneously with or following the displacing step, a positive result can be achieved particularly quickly in the first of the two cases, whereas a particularly exact result can be achieved in the second case.

It is further advantageous when the depth budget which is placed in the disparity table is applied to the second image such that all areas of the second image which lie beyond the depth budget are shown blurred. A depth budget is understood to mean the range caused by the disparities/the region caused by the disparities. Therefore when the smallest disparity is, for example, −50 and the largest disparity is 0, the image comprises a depth budget of −50 to 0.

In order to achieve a particularly efficient blurring, a Gaussian blurring algorithm is used to achieve the blurring in one or a plurality of regions of the second image. Therefore the areas are identified in which the depth budget of the second image is too large in comparison to the depth budget in the first image, and the identified areas are then displayed blurred. Under the Gaussian blurring algorithm, the surrounding pixels are used and the pixels which are to be displayed excessively blurred are recalculated according to a Gaussian normal distribution.

Some embodiments of invention also relate to a controlling or regulating of a plurality of 3D cameras, said 3D cameras being suited for the recording of a stereo film, wherein the controlling or regulating is configured such that it can perform the method according to the invention.

Some embodiments of the invention are also subsequently explained in more detail with the help of a picture. A first exemplary embodiment is visualized in a schematically depicted flowchart of a first figure (FIG. 1), wherein a second exemplary embodiment is visualized in a further figure (FIG. 2).

FIG. 1 shows a flowchart of a first exemplary embodiment of a method according to the invention:

In a first step 10, the recording of a first image takes place with a first camera rig comprising two cameras. In a subsequent second step 20, the disparities in the first image are ascertained.

In a subsequent third step 30, the setting up of a disparity table occurs, which can also be described as a disparity map.

In step 40, a cut takes place from the first camera rig to the second camera rig during the production of the film sequence of the stereo film. The second camera rig also contains two cameras.

In the subsequent step 50, the recording of a second image takes place with the second camera and its two cameras.

In the subsequent step 60, the use of the disparity table for the processing of the second image takes place.

In a sub-step 61, which is followed by a subsequent step 62 in the exemplary embodiment depicted here, the displacing of a sub-frame of the second image to another sub-frame of the second image takes place, wherein both of these two sub-frames form in total the second image. In the sub-step 62, a correction step is performed, in other words the zoom setting in the second image is changed. Thus in the displacing step 61, the disparity distribution in total is changed, whereas in the correction step 62 the present disparities per se are changed.

A blurring step 63 can also be performed in parallel to, subsequent to, or as an alternative to this. The blurring step comprises an identifying of areas of the second image which have too large or too small a disparity compared to the disparities of the first image. A blurring of these areas is realized, for example, by means of a Gaussian blurring algorithm.

A schematic construction of a second exemplary embodiment according to the invention is depicted in the second figure (FIG. 2):

Two cameras 110 and 120 are contained in each camera rig 100, which cameras send image data of a stereoscopic image pair to an image analysis unit 130. The image analysis unit 130 determines the scene depth in terms of near, middle and of a more remote area in real time.

These obtained metadata are either embedded in the image data of one or the other image of the stereoscopic image pair, or embedded in the image data of both images. These processed data are passed on to a switch 140, also identified as switcher.

The switch 140 allows a user to choose among the source data for an output interface 160, under the interposition of an image processor 150. The image processor 150 contains statistical depth budget parameters, in particular background data relating to a maximum allowable change per unit of time.

The image processor 150 manages a dynamic statistic from the depth information obtained from the metadata, calculates rates of change and change magnitudes and ensures that these values are within the depth budget to be used.

If the resulting depth change lies within the predetermined envelope, then the image pair is passed on unchanged to the output interface 160. If this is done repeatedly one after the other, the result is a video sequence.

If the depth change is not contained within the predefined depth budget, an image is adjusted, for example blurred, smudged, obscured, desaturated and/or masked/marked. The bigger the deviation from the depth budget, then the bigger the correcting operation of the blurring, for example. The operation can also include a blackflash or whiteflash, in other words a transition from black or white to some new image content. Instead of processing only a defined area, the entire image of the image pair can be blurred. This is particularly advantageous when no exact disparity map is available.

The method is repeated for each stereoscopic image pair. 

What is claimed is:
 1. A method for producing a motion video recording, comprising: obtaining a first image captured with a first camera rig having at least two cameras; obtaining a second image captured with a second camera rig; calculating, with an image processor, a first displacement of at least one first image point contained in a first sub-frame obtained from a first camera of the first camera rig relative to at least one second image point contained in a second sub-frame obtained from a second camera of the first camera rig and corresponding to the at least one first image point; storing the at least one displacement in a disparity table; and processing the at least one second image using the disparity table.
 2. The method of claim 1, further comprising displacing the second sub-frame relative to the first sub-frame, and wherein the first sub-frame and a fourth sub-frame obtained from a second camera of the second camera rig form the second image.
 3. The method of claim 2, wherein the second sub-frame is horizontally displaced.
 4. The method of claim 2, wherein the processing the at least one second image is performed simultaneously with displacing the second sub-frame relative to the first sub-frame.
 5. The method of claim 2, wherein the processing the at least one second image is performed after displacing the second sub-frame relative to the first sub-frame.
 6. The method of claim 1, further comprising displacing a fourth sub-frame obtained from a second camera of the second camera rig relative to a third sub-frame obtained from a first camera of the second camera rig to create a second displacement.
 7. The method of claim 6, wherein the second displacement is substantially the same as the first displacement.
 8. The method of claim 1, further comprising changing the magnification of the at least one second image in response to the disparity table and until a second depth distance between a second foreground point in a third sub-frame of the second image and a second background point in a fourth sub-frame of the second image is substantially same as a first depth distance between a first foreground point in the first sub-frame and a first background point in the second sub-frame.
 9. The method of claim 8, wherein the magnification is changed digitally.
 10. The method of claim 8, wherein the magnification of both the third sub-frame and the fourth sub-frame is changed.
 11. The method of claim 1, further comprising blurring all areas of the second image that lie outside a depth budget of the disparity table.
 12. The method of claim 11, wherein the blurring is accomplished using a Gaussian blurring algorithm.
 13. The method of claim 1, wherein the displacement corresponds to a depth of the first image.
 14. A system configured to for producing a motion video recording, the system comprising: an image processor configured to: receive a first image comprising at least a first sub-frame obtained from a first camera of a first camera rig and a second sub-frame obtained from a second camera of a first camera rig; receive a second image comprising at least a third sub-frame obtained from a first camera of a second camera rig and a fourth sub-frame from a second camera of a second camera rig; calculate at least one first displacement between at least one first image point in the first sub-frame and at least one corresponding second image point in the second sub-frame; store the at least one first displacement in a disparity table; and process at least one of the third sub-frame and the fourth sub-frame in response to the disparity table.
 15. The system of claim 14, wherein the image processor is further configured to displace the second sub-frame relative to the first sub-frame, and wherein at least the first sub-frame and the fourth sub-frame form the second image.
 16. The system of claim 14, wherein the image processor is further configured to displace the fourth sub-frame relative to the third sub-frame to create at least one second displacement.
 17. The system of claim 14, wherein the image processor is further configured to change the magnification of the second image in response to the at least one first displacement until a second depth distance between a second foreground point in the third sub-frame and a second background point in the fourth sub-frame is substantially the same as a first depth distance between a first foreground point in the first sub-frame and a first background point in the second sub-frame.
 18. The system of claim 14, wherein the image processor is further configured to blur all areas of the second image that lie outside a depth budget of the disparity table. 